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Abstract. We study the following problem: preprocess a set O of objects 
into a data structure that allows us to efficiently report all pairs of 
objects from O that intersect inside an axis-aligned query range Q. 
We present data structures of size 0(n polylog n) and with query time 
0((k + 1) polylog n) time, where k is the number of reported pairs, for two 
classes of objects in the plane: axis-aligned rectangles and objects with 
small union complexity. For the 3-dimensional case where the objects and 
the query range are axis-aligned boxes in R 3 , we present a data structures 
of size 0(riy/n polylog n) and query time 0((^/n + k ) polylogn). When 
the objects and query are fat, we obtain 0((k + 1) polylog n) query time 
using 0(n polylogn) storage. 


1 Introduction 

The study of geometric data structures is an important subarea within com¬ 
putational geometry, and range queries form one of the most widely studied 
topics within this area cun- In a range query, the goal is to report or count 
all points from a given set O that lie inside a query range Q. The more general 
version, where O contains other objects than just points and the goal is to report 
all objects intersecting Q, is often called intersection searching and it has been 
studied extensively as well. 

A common characteristic of the range-searching and intersection-searching 
problems studied so far, is that whether an object Oi £ O should be reported (or 
counted) depends only on Oi and Q. In this paper we study a range-searching 
variant where we are interested in reporting pairs of objects that satisfy a certain 
criterion. In particular, we want to preprocess a set O = {cq,..., o n } of n objects 
in the plane such that, given a query range Q , we can efficiently report all pairs 
of objects Oi, Oj that intersect inside Q. An obvious approach is to precompute 
all intersections between the objects and store the intersections in a suitable 
intersection-searching data structure. This may give fast query times, but in the 
worst case any two objects intersect, so l?(n 2 ) is a lower bound on the storage 
for this approach. The main question is thus: can we achieve fast query times 
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with a data structure that uses subquadratic (and preferably near-linear) storage 
in the worst case? 

We answer this question affirmatively when Q is an axis-aligned rectangle 
in the plane and the objects are either axis-aligned rectangles or objects with 
small union complexity. For axis-aligned rectangles our data structure uses 
0(n log n) storage and has 0((k + l)lognlog*n) query timej^j where k is the 
number of reported pairs of objects. Our data structure for classes of objects with 
small union complexity—disks and other types of fat objects are examples—uses 
0(U(n) log n) storage, where U(n ) is maximum union complexity of n objects 
from the given class, and it has 0{(k +1) log 2 n) query time. We also consider a 3- 
dimensional extension of the planar case, where the range Q and the objects in O 
are axis-aligned boxes. Our data structures for this setting has size 0(jiy/n log n) 
and query time 0((k + 1) log 2 log* n). For the special case where the query range 
and the objects are fat, we present a data structure of 0(n log 2 n) size and 
0((k + 1) log 2 n log* n) query time. 

2 Axis-aligned objects 

In this section we study the case where the set O is a set of n axis-aligned 
rectangles in the plane or boxes in R 3 . Our approach for these cases is the same 
and uses the following two-step query process. 

1. Compute a seed set 0*(Q) C O of objects such that the following holds: for 
any two objects Oi,Oj in O such that Oi and Oj intersect inside Q, at least 
one of Oi,Oj is in 0*(Q). 

2. For each seed object Oi G 0*(Q), perform an intersection query with the 
range Oj 0 Q in the set O, to find all objects Oj ^ Oi intersecting Oj inside Q. 

To make this approach efficient, we need that the seed set 0*{Q) does not contain 
too many objects that do not give an answer in Step [2} For the planar case our 
seed set will satisfy \0*{Q)\ = 0(1 + k), where k denotes the number of pairs 
of objects in O that intersect inside Q , while for the 3-dimensional case we will 
have \0*(Q)\ = O^n + k). 

2.1 The planar case 

Axis-aligned segments. As a warm-up exercise we start with the case where 
O consists of axis-aligned segments. Let O = {si,..., s n } be a set of axis-aligned 
segments, and let V(O) and H(O) denote the set of vertical and horizontal 
segments in O, respectively. We assume for simplicity that we are only interested 
in intersections between horizontal and vertical segments; the solution can easily 
be adapted to the case where we also want to report intersections between two 
horizontal (or two vertical) segments. 

3 Here log* n denotes the iterated logarithm. 
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The key to our approach is to be able to efficiently find the seed set 0*{Q). 
To this end, during the preprocessing we compute an 0(ro)-sized subset W of 
the intersection points in O. We call intersection points in W witnesses. The 
witness set W is defined as follows: for each line segment .s t £ V((D) we put the 
topmost and bottommost intersection points of s* with a segment from H (O) (if 
any) into W; for each line segment s t £ H (O) we put the leftmost and rightmost 
intersection points of Sj with a segment from V(O) (if any) into W. Since we 
take at most two witness points for each line segment, the size of W is clearly at 
most 2 n. 

Our data structure to find the seed set 0*(Q) now consists of three compo¬ 
nents: First, we store W in a data structure T>± for 2-dimensional orthogonal 
range reporting. Second, we store V(0) in a data structure V 2 that allows us 
to decide if there are any segments that completely cross the query rectangle Q 
from top to bottom, and that can report all such segments. Third, we store H (O) 
in a data structure T > 3 that allows us to decide if there are any segments that 
completely cross the query rectangle Q from left to right. 

Step [l] of the query procedure, where we compute 0*(Q), proceeds as follows. 

l(i) Perform a query in T> i to find all witness points inside Q. For each reported 
witness point, insert the corresponding segment into 0*{Q). 

1 (ii) Perform queries in T > 2 and V 3 to decide if the number of segments crossing 
Q completely from top to bottom, and the number of segments crossing Q 
completely from left to right, are both non-zero. If so, report all segments 
crossing completely from top to bottom, and put them into 0*(Q). 

Lemma 1. Let Si , Sj be two segments in O such that Si D Sj £ Q. Then at least 
one of Si,Sj is put into 0*(Q) by the above query procedure. 

Proof. If Si crosses Q completely from left to right and Sj crosses Q completely 
from top to bottom (or vice versa), then one of them will be put into 0*(Q) 
in Step 1 (ii). Otherwise at least one of the segments, say s,, has an endpoint v 
inside Q. But then the intersection point on Sj closest to v, which is a witness 
point, must lie inside Q. Hence, s, is put into 0*(Q) in Step l(i). □ 

In Step [2] of the query procedure we need to report, for each segment s, in the 
seed set 0*(Q ), the segments Sj £ O intersecting Sj O Q. Thus we store O in a 
data structure T >4 that can report all segments intersecting an axis-aligned query 
segment. Putting everything together we obtain the following theorem. 

Theorem 1. Let O be a set of n axis-aligned segments in the plane. Then there is 
a data structure that uses Ofjilogn) storage and can report, for any axis-aligned 
query rectangle Q, all pairs of segments Si,Sj in O such that s, intersects Sj 
inside Q in 0((k + 1) log n log* n) time, where k denotes the number of answers. 

Proof. For the data structure T>\ on the set W we can take a standard 2- 
dimensional range tree [3], which uses 0(n log n) storage. If we apply fractional 
cascading [3], reporting the witness points inside Q takes 0(logn + ^answers) 
time. For T >2 (and, similarly, V 3 ) we note that a vertical segment Si := Xi x [j/j, y'f\ 
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crosses Q := [xq,x'q] x [j/q,j/q] if and only if the point (aq, y t . y') lies in the 
range [xq,x'q] x [—oo,j/q] x [j/q,oo]. Hence, we can use the data structure 
of Subramanian and Ramaswamy [l3j . which uses O(nlogn) storage and has 
0(lognlog* n + ^answers) query time. Hence, the supporting data structures 
for Step [I] use 0(n log n) storage, and finding the seed set takes 0(log?rlog* n + 
\0*(Q)\) time. 

It remains to analyze Step [ 2 ] of the query procedure. First notice that the 
problem of finding for a given s 7 ; £ 0*(Q) all Sj £ O such that SiHQ intersects 
Sj, is the same range-searching problem as Step 1 (ii), except that the query range 
is a line segment this time. Hence, we again transform the problem to a 3D 
range-searching problem on points and use the data structure of Subramanian and 
Ramaswamy [13]. Thus the running time of Step[2lis Yl 3i eO*(Q) 0(l°g log* n + ki), 
where ki denotes the number of segments in (Urnat intersect inside Q. Since 
|£>*(< 2 )| ^ 2 k where k is the total number of reported pairs—each segment in 
0*(Q) intersects at least one other segment inside Q and for every reported 
pair we put at most two segments into the seed set—the time for Step [2] is 
O(|0*(Q)| lognlog* n + k) = 0 {(k + 1) lognlog* n). □ 


Axis-aligned rectangles. We now extend our approach to axis-aligned rectan¬ 
gles. Let O = {ri,..., r n } be a set of axis-aligned rectangles in the plane. Similar 
to the case of axis-aligned segments we need to find the seed set O* ( Q ) efficiently. 
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As before, we first define a witness set W. The 
witnesses in W are now axis-aligned segments rather 
than just points. For each rectangle £ O we define 
at most ten witness segments, two for each edge of r, 
and two in the interior of r.j. as follows—see also Fig. [T] 

Let e be an edge of j-j, and consider the set 5(e) := 
efl (U j&rj), that is, the part of e covered by the other 
rectangles. The set 5(e) consists of a number of sub¬ 
edges of e. If e is vertical then we add the topmost and 
bottommost sub-edge from 5(e) (if any) to W; if e is 
horizontal we add the leftmost and rightmost sub-edge 
to W. The two witness segments in the interior of r, 
are defined as follows. Suppose there are vertical edges 

(belonging to other rectangles r ? ) completely crossing r, from top to bottom. 
Then we put e'rwy into W, where e! is the rightmost such crossing edge. Similarly, 
we put into W the topmost horizontal edge e" completely crossing r, from left to 
right. Our data structure to find the seed set C>*(<2) now consists of the following 
components. 


Fig. 1 : Gray areas are in¬ 
tersections with rj, black 
segments indicate wit¬ 
ness segments. 


— We store the witness set W in a data structure V 1 that allows us to report 
the set of segments that intersect the query rectangle Q. 

— We store the vertical edges of the rectangles in O in a data structure T> 2 that 
allows us to decide if the set V(Q) of edges that completely cross a query 
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rectangle Q from top to bottom, is non-empty. The data structure should 
also be able to report all (rectangles corresponding to) the edges in V(Q). 

— We store the horizontal edges of the rectangles in O in a data structure V 3 
that allows us to decide if the set H (Q) of edges that completely cross a query 
rectangle Q from left to right, is non-empty. 

— We store O in a data structure V 4 that allows us to report the set of rectangles 
that contain a query point q. 

Step [I] of the query procedure, where we compute 0*(Q), proceeds as follows. 

l(i) Perform a query in T >i to find all witness segments intersecting Q. For each 
reported witness segment, insert the corresponding rectangle into 0*(Q). 

1 (ii) Perform queries in V 2 and V 3 to decide if the sets V(Q) and H(Q) are both 
non-empty. If so, report all rectangles corresponding to edges in V(Q) and 
put them into 0*(Q). 

l(iii) For each corner point q of Q, perform a query in T >4 to report all rectangles 
in O that contain q, and put them into 0*(Q). 

The next lemma can be proved using a case analysis—see the Appendix [A] 

Lemma 2. Let r,, Tj be two rectangles in O such that ( 7 ^ D rj) f~l Q ^ 0. Then 
at least one of ri,Vj is put into 0*(Q) by the above query procedure. 

In the second part of the query procedure we need to report, for each rectangle r ; ; 
in the seed set 0*{Q ), the rectangles rj £ O intersecting r t n Q. Thus we store 
O in a data structure T> 5 that can report all rectangles intersecting a query 
rectangle. Putting everything together we obtain the following theorem. 

Theorem 2. Let O be a set of n axis-aligned rectangles in the plane. There is a 
data structure that uses 0 (n\ogn) storage and can report, for any axis-aligned 
query rectangle Q, all pairs of rectangles ri,rj in O such that intersects rj 
inside Q in 0{(k + 1) log n log* n) time, where k denotes the number of answers. 

Proof. For the data structure D\ on the set W we use the data structure 
developed by Edelsbrunner et al. [2], which uses 0(n log n) preprocessing time 
and storage, and has 0 (logn + ^answers) query time. 

Data structure V 2 (and, similarly, V 3 ) answers the same type of query we 
needed when O contains segments. Hence, we can use the same data structure m 
which uses 0 (n log n) space and has 0 (log?zlog*n + ^answers) query time. 
For data structure V 4 we use the point-enclosure data structure developed by 
Chazellc j4j, which uses 0{n) storage and can be used to report all rectangles in 
O containing a query point in Oflogn + ^answers) time. 

The analysis of Step [2] is similar to the analysis for the case of axis-aligned 
segments, except that we now have \(D*(Q)\ ^ 2fc + 4, where k is the total number 
of pairs of rectangles that will be reported; the extra term “+4” is because in 
Step [ljiii) we may report at most one rectangle per corner of Q that does not 
have an intersection inside Q. Again, finding the rectangles in O intersecting 
riCiQ, for a given 7^ € 0*(Q), can be done in 0(lognlog* n + ff answers), leading 
to an overall query time of 0 {{k + 1 ) log n log* n). □ 
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2.2 The 3-dimensional case 

We now study the case where the set O of objects and the query range Q are 
axis-aligned boxes in R 3 . We first present a solution for the general case, and 
then an improved solution for the special case where the input as well as the 
query are cubes. Both solutions use the same query strategy as above: we first 
find a seed set 0*(Q) that contains at least one object o* from every pair that 
intersects inside Q and then we find all other objects intersecting oi inside Q. 

The general case. Let O := {bi ,..., b n } be a set of axis-aligned boxes. The 
pairs of boxes bi,bj intersecting inside Q come in three types: (i) bi D bj fully 
contains Q, (ii) bi n bj lies completely inside Q , (iii) bi n bj intersects a face of Q. 

Type (i) is easy to handle without using seeds sets: we simply store O in a 
data structure for 3-dimensional point-enclosure queries [3J, which allows us to 
report all boxes bi £ O containing a query point in 0( log 2 n + ^answers) time. 
If we query this structure with a corner q of Q and report all pairs of boxes 
containing q then we have found all intersecting pairs of Type (i). 

Lemma 3. We can find all intersecting pairs of boxes of Type (i) in 0{ log 2 n + k) 
time, where k is the number of such pairs, with a structure of size O(nlogn). 

For Type (ii) we proceed as follows. Note that a vertex of bi D bj is either a 
vertex of bi or bj , or it is the intersection of an edge e of one of these two boxes 
and a face / of the other box. To handle the first case we create a set W of 
witness points, which contains for each box bi all its vertices that are contained 
in at least one other box. We store W in a data structure for 3-dimensional 
orthogonal range reporting m- In the query phase we then query this data 
structure with Q, and put all boxes corresponding to the witness vertices inside Q 
into the seed set 0*(Q). For the second case we show next how to find the 
intersecting pairs e, / where e is a vertical edge (that is, parallel to the z-axis) 
and / is a horizontal face (that is, parallel to the cry-plane); the intersecting pairs 
with other orientations can be found in a similar way. 

Let E be the set of vertical edges of the boxes in O and let F be the set of 
horizontal faces. We sort F by ^-coordinate—we assume for simplicity that all 
^-coordinates of the faces are distinct—and partition F into 0(y/n) clusters: the 
cluster F\ contains the first y/n faces in the sorted order, the second cluster F 2 
contains the next y/n faces, and so on. We call the range between the minimum 
and maximum z-coordinate in a cluster its z-range. For each cluster F t we store, 
besides its z-range and the set F, itself, the following information. Let Ei C E 
be the subset of edges that intersect at least one face in F i: and let E, denote 
the set of points obtained by projecting the edges in Ei onto the cry-plane. We 
store Ei in a data structure D(Ei) for 2-dimensional orthogonal range reporting. 
Note that an edge e £ E intersects at least one face f £ Fi inside Q if and only 
if e £ Ei and e lies in Q , the projection of Q onto the cry-plane. 

A query with a box Q = [aq : x 2 ] x [yi : y 2 \ x [z\ : z 2 ] is now answered as 
follows. We first find the clusters F) and Fj whose z-range contains zi and z 2 , 
respectively, and we put (the boxes corresponding to) the faces in these clusters 
into the seed set 0*(Q). Next we perform, for each i < t < j, a query with the 
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projected range Q in the data structure V{Ei). For each of the reported points e 
we put the box corresponding to the edge e into the seed set 0*(Q). Finally, we 
remove any duplicates from the seed set. 

We obtain the following lemma, whose proof is in the Appendix [A] 

Lemma 4. Using a data structure of size 0(n^/nlogn) we can find in time 
0 (log n log* n + k) a seed set O* ( Q ) of 0(y/n + k ) boxes containing at least one 
box from every intersecting pair of Type (ii), where k is the number of such pairs. 

It remains to handle the Type (iii) pairs, in which bi fl bj intersects a face of Q. 
We describe how to find the pairs such that bi n bj intersects the bottom face of 
Q; the pairs intersecting the other faces can be found in a similar way. 

We first sort the ^-coordinates of the horizontal faces of the boxes in O. For 
1 ^ i ^ 2 y/n, let hi be a horizontal plane containing the iy/n- th horizontal 
face in the ordering. These planes partition R 3 into O(yfn) horizontal slabs 
JCo, • ■ ■, ^2 y/n+ 1 • We call a box b G O short for a slab Si if it has a horizontal 
face inside A). and we call it long if it completely crosses A,;. For each A,, we 
store the short boxes in a list. We store the projections of the long boxes onto the 
.Ty-plane in a data structure 'D(Ei) for the 2-dimensional version of the problem, 
namely the structure Theorem [2] 

A query with the bottom face of Q is now answered as follows. We first find 
the slab A, containing the face. We put all short boxes of A* into our seed set 
0*(Q). We then perform a query with Q , the projection of Q onto the xy- plane, 
in the data structure 2?(Ai). For each answer we get from this 2-dimensional 
query—that is, each pair of projections intersecting inside Q —we directly report 
the corresponding pair of long boxes. (There is no need to go through the seed 
set for these pairs.) This leads to the following lemma for the Type (iii) pairs. 

Lemma 5. Using a data structure of size 0(nyfn log n) we can find in time 
0(y/n + (k + l)log*nlogn) a seed set 0*(Q) of 0(y/n) boxes plus a collection 
B(Q) of pairs of boxes intersecting inside Q such that, for each pair of Type (iii) 
boxes, either at least one of these boxes is in 0*(Q) or bi, bj is a pair in B{Q). 

In the second step of our query procedure we need to be able to report all boxes 
bj € O intersecting a query box B of the form Q n bi, where bi G 0*(Q). Note 
that B and bj intersect if and only if their projections onto the 2 -axis intersect 
and their projections onto the xy- plane intersect. Hence, we can answer the 
queries with a data structure T>* whose main tree is a (hereditary) segment 
tree [B] and whose associated structures are the data structure of Subramanian 
and Ramaswamy m- This leads to a structure using 0(relog 2 n) storage and 
0 ( log 2 n log* n + #answers) query time. 

Putting everything together we obtain the following theorem. 

Theorem 3. Let O be a set of n axis-aligned boxes in R 3 . Then there is a data 
structure that uses 0(ny/n\ogn) storage and that allows us to report, for any 
axis-aligned query box Q, all pairs of boxes bi, bj in O such that bi intersects bj 
inside Q in 0(y/n + (k + 1) log 2 nlog* n) time, where k denotes the number of 
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Fat boxes. Next we obtain better bounds when the boxes in O and the query 
box Q are fat, that is, when their aspect ratio —the ratio between the length of 
the longest edge and the length of the shortest edge—is bounded by a constant a. 
First we consider the case of cubes. 

Let O := {ci, • • • , c„} be a set of n cubes in M 3 and let Q be the query cube. 
We compute a set W of witness points for each cube c,, as follows. Let e be an 
edge of Ci , and consider the set 5(e) := efl (Uj^jC,-), that is, the part of e covered 
by the other cubes. We put the two extreme points from 5(e)—in other words, 
the two points closest to the endpoints of e—into W. Similarly, we assign each 
face / of Ci at most four witness points, namely points from 5(/) := / fl (U j^iCj) 
that are extreme in the directions parallel to /. For example, if / is parallel to 
the xy-plane, then we take points of maximum and minimum ^’-coordinate in 
5(/) and points of maximum and minimum y-coordinate in 5(/) as witnesses. 
We store W in a data structure V\ for orthogonal range queries, and we store O 
in a data structure V 2 for point-enclosure queries. 

To compute 0*(Q) in the first phase of the query procedure, we query D\ to 
find all witness points inside Q and for each reported witness point, we insert the 
corresponding cube into 0*(Q). Furthermore, for each corner point q of Q, we 
query U 2 to find the cubes in O that contain q , and we put them into 0 *{Q). 

Lemma 6 . Let d, Cj be two cubes in O such that (Ci HCj) DQ 7 ^ 0. Then at least 
one of Ci,Cj is put into 0*(Q) by the above query procedure. 

Proof. Suppose Ci 0 Cj intersects Q, and assume without loss of generality that Cj 
is not larger than Cj. If c,; or Cj contains a corner q of Q then the corresponding 
cube will be put into the seed set when we perform a point-enclosure query 
with q, so assume Ci and Cj do not contain a corner. We have two cases. 

Case A: d does not intersect any edge of Q. Because Cj and Q are cubes, this 
implies that d is contained in Q or C; intersects exactly one face of Q. Assume 
that Cj intersects the bottom face of Q\ the cases where Cj intersects another face 
and where Cj is contained in Q can be handled similarly. We claim that at least 
one of the vertical faces of Cj contributes a witness point inside Q. To see this, 
observe that Cj will intersect at least one vertical face, /, of Cj inside Q 1 since Cj 
intersects Cj inside Q and Cj is not larger than Cj. Hence, the witness point on / 
with maximum 2 -coordinate will be inside Q. Thus Cj will be put into 0*(Q). 

Case B: Cj intersects one edge of Q. (If Cj intersects more than one edge of 
Q then it would contain a corner of Q.) Assume without loss of generality that 
Cj intersects the bottom edge of the front face of Q; see Fig. [2] Observe that if 
Cj intersects the top face of Cj then the witness point of the face with minimum 
^-coordinate is inside Q. Similarly, if Cj intersects the back face of Cj (the face 
parallel to the yz-plane and with minimum ^-coordinate) then the witness point 
of the face with maximum 2 -coordinate is inside Q. Otherwise, as illustrated in 
Fig[3j c,j must have an edge e parallel to the y-axis that intersects Cj inside Q , 
and one of the witness points on e will be inside Q —note that e lies fully inside 
Q because Cj does not contain a corner of Q. □ 
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z-axis 



Fig. 2: Case B in the proof of 
Lemma [6l Cj is not shown. 
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Cj 



Fig. 3: Cross-section of Q , c,-, and Cj 
with a plane parallel to the irz-plane. 
The gray area indicates Q (~l c,; in the 
cross-section. 


To adapt the above solution to boxes of aspect ratio at most a, we cover 
each box # £ O by 0(a 2 ) cubes, and preprocess the resulting collection O of 
cubes as described above, making sure we do not introduce witness points for 
pairs of cubes used in the covering of the same box bi. To perform a query, we 
cover Q by 0(a 2 ) query cubes and compute a seed set for each query cube. We 
take the union of these seed sets, replace the cubes from O in the seed set by the 
corresponding boxes in O , and filter out duplicates. This gives us our seed set 
0*(Q) for the second phase of the query procedure. 

In the second phase we take each bi £ O* ( Q) and report all bj £ O intersect¬ 
ing bi n Q , using the data structure V* 
the following theorem. 


described in Subsection 2.2 We obtain 


Theorem 4. Let O be a set ofn axis-aligned boxes in M 3 of aspect ratio at most a. 
Then there is a data structure that uses 0(a 2 n log 2 n) storage and that allows us 
to report, for any axis-aligned query box Q of aspect ratio at most a, all pairs of 
cubes Ci, Cj in O such that Ci intersects Cj inside Q in 0(a 2 (k + 1) log 2 log* n) 
time, where k denotes the number of answers. 


Proof. The data structures T>\ and V >2 can be implemented such that they use 
O(nlogn) storage, and have 0 (lognlog* 71+#answers) and 0 ( log 2 n+#answers) 
query time, respectively nan I n St ep [2] of the query procedure we use the 
data structure V* of Subsection 2.2, which uses 0(n\og 2 n) storage and has 
0(log 2 log* n + #answers) query time. The conversion of boxes of aspect ratio a 
to cubes give an additional factor 0 (a 2 ). □ 


3 Objects with small union complexity in the plane 

In the previous section we presented efficient solutions for the case where O 
consists of axis-aligned rectangles. In this section we obtain results for classes 
of constant-complexity objects (which may have curved boundaries) with small 
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union complexity. More precisely, we need that U{n), the maximum union 
complexity of any set of n objects from the class, is small. This is for instance 
the case for disks (where [/(to) = O(m) IlU) and for locally fat objects (where 
[/(to) =TO2°( lo s*"d P3). 

In Step [2] of the query algorithm of the previous section, we performed a 
range query with Oi D Q for each Oi £ O* ( Q ). When we are dealing with arbitrary 
objects, this will be expensive, so we modify our query procedure. 

1. Compute a seed set 0*(Q) C O of objects such that, for any two objects 
Oi,Oj in O intersecting inside Q , both Oi and Oj are in 0*(Q). 

2. Compute all intersecting pairs of objects in the set {oj fl Q : £ 0*(Q)} by 

a plane-sweep algorithm. 

Next we describe how to efficiently find 0*(Q ), which should contain all objects 
intersecting at least one other object inside Q , when the union complexity U ( n ) 
is small. For each object Oi £ O we define o* := Uo eO 1-1 °j) as the un i° n 
of all intersections between Oi and all other objects in O. Let |o*| denote the 
complexity (that is, number of vertices and edges) of o*. 

Lemma 7. £7=1 |o*| = 0{U{n)). 

Proof. Consider the arrangement induced by the objects in O. We define the 
level of a vertex v in this arrangement as the number of objects from O that 
contain v in their interior. We claim that every vertex of any o* is a level-0 or 
level-1 vertex. Indeed, a level-A: vertex for fc > 1 is in interior of more than one 
object, which is easily seen to imply that it cannot be a vertex of any o*. 

Since the level-0 vertices are exactly the vertices of the union of O , the total 
number of level-0 vertices is U ( n ). It follows from the Clarkson-Shor technique [7] 
that the number of level-1 vertices is 0(U(n)) as well. The lemma now follows, 
because each level-0 or level-1 vertex contributes to at most two different o*’s. □ 

Our goal in Step [l] is to find all objects o t such that o* intersects Q. To this end 
consider the connected components of o*. If o* intersects Q then one of these 
components lies completely inside Q or an edge of Q intersects o*. 

Lemma 8. We can find all o* that have a component completely inside Q in 
0(logn + k) time, where k is the number of pairs of objects that intersect inside Q, 
with a data structure that uses 0{U{n)\ogn) storage. 

Proof. For each Oi , take an arbitrary representative point inside each component 
of o*, and store all the representative points in a structure for orthogonal range 
reporting. By Lemma [7] we store 0(U(ri)) points, and so the structure for 
orthogonal range reporting uses O(U(n) log n) storage. 

The query time is 0(logn + t), where t is the number of representative 
points inside Q. This implies the query time is Oflogn + k), because if o* has 
ti representative points inside Q then Oi intersects fiftf) other objects inside Q. 
This is true because the objects have constant complexity, so a single object Oj 
cannot generate more than a constant number of components of o*. □ 
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Next we describe a data structure for reporting all o* intersecting a vertical edge 
of Q\ the horizontal edges of Q can be handled similarly. The data structure is a 
balanced binary tree T, whose leaves are in one-to-one correspondence to the 
objects in O. For an (internal or leaf) node v in T, let T{y) denote the subtree 
rooted at v and let 0 {y) denote the set of objects corresponding to the leaves 
of T[y). Define U{v) := U Oi eO(u)0*. At node v, we store a point-location data 
structure {8| on the trapezoidal map of U(v). (If the objects are curved, then the 
“trapezoids” may have curved top and bottom edges.) 

Lemma 9. The tree T uses 0(U(n)\ogn) storage and allows us to report all 
o* intersecting a vertical edge s of Q in 0((t+ l)log 2 n) time, where t is the 
number of answers. 

Proof. To report all o* intersecting s we walk down T, only visiting the nodes v 
such that s intersects U{v). This way we end up in the leaves corresponding to 
the o* intersecting s. To decide if we have to visit a child v of an already visited 
node, we do a point location with both endpoints of s in the trapezoidal map 
of U(u). Now s intersects U(u) if and only if one of these endpoints lies in a 
trapezoid inside U(y) and/or the two endpoints he in different trapezoids. Thus 
we spend 0(log?z) time for the decision. Since we visit O(fclogn) nodes, the total 
query time is as claimed. 

To analyze the storage we claim that the sum of the complexities of U {v) over 
all nodes v at any fixed height of T is 0(U(n)). The bound on the storage then 
follows because the point-location data structures take linear space [8] and the 
height of T is O(logn). It remains to prove the claim. Consider a node v at a 
given height h in T. Lemma [5] in Appendix |A| proves that each vertex in U(y) is 
either a level-0 or level-1 vertex of the arrangement induced by the objects in 
0{u), or a vertex of o*, for some Oi in 0{y). The proof of the claim then follows 
from the following two facts. First, the number of vertices of the former type 
is 0(U{\0(v)\)), which sums to 0(U(n)) over all nodes at height h. Second, by 
Lemma [7] the number of vertices of the latter type over all nodes at height h 
sums to 0 (U (n)). □ 


Theorem 5. Let O be a set of n constant-complexity objects in the plane from 
a class of objects such that the maximum union complexity of any m objects 
from the class is U(m). Then there is a data structure that uses 0(U(n) log?r) 
storage and that allows us to report for any axis-aligned query rectangle Q, in 
0((k + 1) log 2 n) time all pairs of objects Oi,Oj in O such that Oi intersects Oj 
inside Q, where k denotes the number of answers. 

4 Concluding remarks 

We presented data structures for finding intersecting pairs of objects inside 
a query rectangle. An obvious open problem is whether our bounds can be 
improved. In particular, one would hope that better solutions are possible for 
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3-dimensional boxes, where we obtained 0((k + y/n) polylog n) query time with 
0(riy/nlogn ) storage. (It is possible to reduce the query time in our solution 
to 0 ((k + m ) polylogn), for any 1 ^ m ^ y/n, but at the cost of increasing the 
storage to 0((n 2 /?n) polylog n).) 

Two settings where we have not been able to obtain efficient solutions are 
when O is a set of balls in R 3 , and when O is a set of arbitrary segments in the 
plane. Especially the latter setting seems challenging. Indeed, consider the special 
case where O consist of n/2 horizontal lines and n/2 lines of slope 1. Suppose 
furthermore that the query is a vertical line £ and that we only want to check 
if £ contains at least one intersection. A data structure for this setting could 
be used to solve the following 3SuM-hard problem: given three sets of parallel 
lines, decide if there is a triple intersection pTj. Thus it is unlikely that we can 
obtain a solution with (significantly) sublinear query time and (significantly) 
subquadratic preprocessing time in the setting just described. However, storage 
is not the same as preprocessing time. This raises the following question: is it 
possible to obtain sublinear query time with subquadratic storage? 
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Fig. 4: A possible situation in Case B-3-I. 


A Omitted proofs 

Lemma 2. Let ri, rj be two rectangles in O such that (r^ D rj) n Q ^ 0. Then 
at least one of ri,rj is put into 0*(Q ) by the above query procedure. 

Proof. Let / := (r, n Vj ) fl Q. Each edge of I is either contributed by r ; ; or rj. or 
by Q. Let E{I) denote the set of edges of rj and r 7 - that contribute an edge to I. 
We distinguish two cases, with various subcases. 

Case A: At least one edge e £ E{I) has an endpoint, v, inside Q. Now the 
witness sub-edge on e closest to v must intersect Q and, hence, the corresponding 
rectangle will be put into 0*(Q) in Step[lji). 

Case B: All edges in E(I) cross Q completely. We now have several subcases. 

Case B-l: \E(I)\ < 1. Now Q contributes at least three edges to I, so at 
least one corner of I is a corner of Q. Hence, both rj and rj are put into (D*(Q) 
in Step [ljiii) . 

Case B-2: \E(I)\ ^ 3. Since each edge of E(I) crosses Q completely and 
\E(I)\ ^ 3, both V(Q) and H(Q) are non-empty. Thus at least one of and r :j is 
put into 0*(Q) in Step[ljii). 

Case B-3: \E(I)\ = 2. Let ei and e-i denote the segments in E(I). If one 
of ei,e2 is vertical and the other is horizontal, we can use the argument from 
Case B-2. It remains to handle the case where e\ and e2 have the same orientation, 
say vertical. 

Case B-3-i: Edges e\ and e 2 belong to the same rectangle, say rj, as in Fig. [4} 
If ei has an endpoint, v, inside rj, then e\ has a witness sub-edge starting at v 
that intersects Q, so Ty is put into 0*(Q) in Step[lji). If rj contains a corner of 
Q then rj will be put into 0*(Q ) in Step [TJiii) . In the remaining case the right 
edge of Tj crosses Q and there are vertical edges completely crossing rj (namely 
ei and 62). Hence, the rightmost edge completely crossing r.j, which is a witness 
for rj, intersects Q. Thus rj is put into 0*{Q) in Step [lji) . 

Case B-3-ii: Edge e\ is an edge of r,; and e2 is an edge of rj (or vice versa). 
Assume without loss of generality that the y-coordinate of the top endpoint of e\ 
is less than or equal to the y-coordinate of the top endpoint of e2- Then the top 
endpoint, v, of e\ must lie in rj, and so e\ has a witness sub-edge starting at v 
that intersects Q. Hence, r, t is put into 0*(Q) in Step [lji) . □ 
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(a) Case A in the proof 
of Lemma [5] 


(b) Case B in the proof 
of Lemma [5] 


Fig. 5: Different cases in the proof of Lemma [5J To simplify the presentation we 
assumed the objects are disks, o* and o* are surrounded by dark green and dark 
red, respectively. Regular arcs are in solid and irregular arcs are in dashed. The 
blue vertex refers to vertex u in the proof. 


Lemma 4. Using a data structure of size 0(ny/n\ogn) we can find in time 
0 (log n log* n + k) a seed set O* ( Q ) of 0 (y/n + k ) boxes containing at least one 
box from every intersecting pair of Type (ii), where k is the number of such pairs. 

Proof. The Type (ii) intersections hi (~l bj either have a vertex that is a vertex of 
bi or bj inside Q , or they have an edge-face pair intersecting inside Q. To find 
seed objects for the former pairs we used O(nlogn) storage and 0(lognlog* n + 
^answers) query time, and we put O(k) boxes into the seed set. For the latter 
pairs, we used an approach based on clusters. For each cluster F, we have a 
data structure V{E i ) that uses 0(n log n) storage, giving 0(n^/n log n) storage 
in total. Besides the O(yfn) boxes in the two clusters Fi and Fj. we put boxes 
into the seed set for the clusters F t with i < t < j, namely when querying the 
data structures T>{Ei). This means that the same box may be put into 0*(Q) up 
to y/n times. (Note that these duplicates are later removed.) However, each copy 
we put into the seed set corresponds to a different intersecting pair. Together 
with the fact that the query time in each T>(E t ) is 0(logrclog* n + ^answers) 
this means the total query time and size of the seed set are as claimed. □ 


Lemma 5. Each vertex in Lt(v) is either a level-0 or level-1 vertex of the ar¬ 
rangement induced by the objects in 0(y), or a vertex of o*, for some Oi in 
0 {v). 

Proof. Define 0*(u) := {o* : Oi £ 0(u)}. Any vertex u of U(y) that is not 
a vertex of some o* £ 0 *{v) must be an intersection of the boundaries of some 
o*, o* £ 0{v). Note that the boundary do* of an object o* consists of two types 
of pieces: regular arcs , which are parts of the boundary of Oi itself, and irregular 
arcs , which are parts of the boundary of some other object Ok- To bound the 
number of vertices of U(u) of the form do* n do* we now distinguish three cases. 


Finding Pairwise Intersections Inside a Query Range 


15 


Case A: Intersections between two regular arcs. In this case u is either a 
level-0 vertex of the arrangement defined by 0 (v) (namely when u is contained 
in no other object Ok £ O(v)), or a level-1 vertex of that arrangement (when u is 
contained in a single object Ok £ 0{v)). Note that u cannot be contained in two 
objects from 0 {y), because then u would be in the interior of some o* k £ 0 *{v ), 


contradicting that u is a vertex of U(y). See Fig 5a 


Case B: Intersections between a regular arc and an irregular arc. Without 
loss of generality, assume that u is the intersection of a regular arc of do* and 
an irregular arc of do*. Note that this implies that u lies in the interior of Oj. If 
there is no other object Ok £ O containing u then it would be a vertex of o*, and 
if there is at least one object Ok £ O containing u then u would not lie on do*. 
So, under the assumption that u is not already a vertex of o*, Case B does not 
happen. See Fig [5b] 

Case C: Intersections between two irregular arcs. In this case u lies in the 
interior of both Oi and Oj. But then u should also be in the interior of o* and o*, 
so this case cannot happen. □ 



